Meaning Matters: Senses of Words are More Informative than Words for Cross-domain Sentiment Analysis
نویسندگان
چکیده
Getting labeled data in each domain is always an expensive and a time consuming task. Hence, cross-domain sentiment analysis has emerged as a demanding research area where a labeled source domain facilitates classifier for an unlabeled target domain. However, cross-domain sentiment analysis is still a challenging task because of the differences across domains. A word which is used with positive polarity in the source domain may bear negative polarity in the target domain or vice versa. In addition, a word which is used very scarcely in the source domain may have high impact in the target domain for sentiment classification. Due to these differences across domains cross-domain sentiment analysis suffers from negative transfer. In this paper, we propose that senses of words in place of words help to overcome the differences across domains, which in turn leads to improvement in cross-domain sentiment classification accuracy. Results show that senses of words provide a better sentiment classifier in the unlabeled target domain in comparison to words for 12 pairs of source and target domains.
منابع مشابه
Study of Significance Tests with respect to Sentiment Analysis
Sentiment Analysis (SA) has grown tremendously over the decade. More and more sophisticated techniques are built to tackle the problem. The evolution of methods has been on several different dimensions. Some of these are complexity of algorithm, the knowledge source used, etc. The SA task is to predict the sentiment orientation of a text (document/para/sentence) by analyzing the polarity of wor...
متن کاملCreating a General Russian Sentiment Lexicon
The paper describes the new Russian sentiment lexicon RuSentiLex. The lexicon was gathered from several sources: opinionated words from domain-oriented Russian sentiment vocabularies, slang and curse words extracted from Twitter, objective words with positive or negative connotations from a news collection. The words in the lexicon having different sentiment orientations in specific senses are ...
متن کاملAutomatically Identifying Changes in the Semantic Orientation of Words
The meanings of words are not fixed but in fact undergo change, with new word senses arising and established senses taking on new aspects of meaning or falling out of usage. Two types of semantic change are amelioration and pejoration; in these processes a word sense changes to become more positive or negative, respectively. In this first computational study of amelioration and pejoration we ad...
متن کاملA Comparison among Significance Tests and Other Feature Building Methods for Sentiment Analysis: A First Study
Words that participate in the sentiment (positive or negative) classification decision are known as significant words for sentiment classification. Identification of such significant words as features from the corpus reduces the amount of irrelevant information in the feature set under supervised sentiment classification settings. In this paper, we conceptually study and compare various types o...
متن کاملDomain Sentiment Matters: A Two Stage Sentiment Analyzer
There are words that change its polarity from domain to domain. For example, the word deadly is of positive polarity in the cricket domain as in “Shane Warne is a ‘deadly’ leg spinner”. However, ‘I witnessed a deadly accident’ carries negative polarity and going by the sentiment in cricket domain will be misleading. In addition to this, there exist domainspecific words, which have the same pola...
متن کامل